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PAPILLOMA VIRUS CAPSOMERE VACCINE 
FORMULATIONS AND METHODS OF USE 



FIELD OF THE INVENTION 

The present invention relates to vaccine formulations 
5 comprising papilloma virus proteins, either as fusion proteins, truncated 
proteins, or truncated fusion proteins. The invention further embraces 
methods for producing capsomeres of the formulations, as well as 
prophylactic and therapeutic methods for their use. 

BACKGROUND 

10 Infections with certain high-risk strains of genital papilloma 

viruses in humans (HPV) -- for example. HPV 16, 18, or 45 — are 
believed to be the main risk factor for the formation of malignant tumors of 
the anogenital tract. Of the possible malignancies, cervical carcinoma is by 
far the most frequent; according to an estimate by the World Health 

15 Organization (WHO), almost 500.000 new cases of the disease occur 

annually. Because of the frequency with which this pathology occurs, the 
connection between HPV infection and cervical carcinoma has been 
extensively examined, leading to numerous generalizations. 

For example, precursor lesions of cervical intraepithelial 

20 neoplasia (ON) are known to be caused by papilloma virus infections 

[Crum, New Eng. J. Med. 370:880-883 (1984)]. DNA from the genomes 
of certain HPV types, including for example, strains 16, 18, 33, 35, and 
45, have been detected in more than 95% of tumor biopsies from patients 
with this disorder, as well as in primary cell lines cultured from the 

25 tumors. Approximately 50 to 70% of the biopsied CIN tumor cells have 
been found to include DNA derived only from HPV 16. 

The protein products of the HPV 16 and HPV 18 early genes 
E6 and E7 have been detected in cervical carcinoma cell lines as well as in 
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by BPV VLPs. The mouse sera was therefore positive for neutralizing antibodies 
against the human VLPs and this differential neutralization was most likely the 
result of antibody specificity for epitopes against which the antibodies were raised. 

Numerous modifications and variations in the invention as set forth 
5 in the above illustrative examples are expected to occur to those skilled in the art. 
Consequently only such limitations as appear in the appended claims should be 
placed on the invention. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

<i) APPLICANT: 

(ii) TITLE OF INVENTION: Papilloma Virus Capsomere Vaccine Formulations 
and Methods of Use 

(iii) NUMBER OF SEQUENCES: 27 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Marshall, O'Toole, Gerstein, Murray & Borun 

(B) STREET: 23 3 South Waclcer Drive, 6300 Sears Tower 

(C) CITY: Chicago 

(D) STATE: Illinois 

(E) COUNTRY: United States of America 

(F) ZIP: 60606-6402 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppv disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC- DOS /MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

<vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Williams Jr., Joseph A. 

(B) REGISTRATION NUMBER: 38,659 

(C) REFERENCE /DOCKET NUMBER: 27013/34028 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 312-474-6300 

(B) TELEFAX: 312-474-0448 

(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 151B base pairs 
(BJ TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 1..1518 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:l: 

ATG TCT CTT TGG CTG CCT AGT GAG GCC ACT GTC TAC TTG CCT CCT GTC 4 8 

Met Ser Leu Trp Leu Pro Ser Glu Ala Thr Val Tyr Leu Pro Pro Val 
15 10 15 

CCA GTA TCT AAG GTT GTA AGC ACG GAT GAA TAT GTT GCA CGC ACA AAC 96 
Pro Val Ser Lys Val Val Ser Thr Asp Glu Tyr Val Ala Arg Thr Asn 
20 25 30 
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ATA TAT 
He Tyr 



TAT TTT 
Tyr Phe 
50 

GTA TCA 
Val Ser 
65 

AAT AAG 
Asn Lys 



TAT CAT 
Tyr His 
35 

CCT ATT 
Pro He 



GGA TTA 
Gly Leu 



TTT GGT 
Phe Gly 



CGG CTG 
Arg Leu 



TTA GGT 
Leu Gly 



ACA GAA 
Thr Glu 
130 

GAA TGT 
Glu Cys 
145 

TGC AAA 
Cys Lys 



GTT TGG 
Val Trp 
100 

GTG GGC 
Val Gly 
115 

AAT GCT 
Asn Ala 



ATA TCT 
He Ser 



CCA CCT 
Pro Pro 



AAT GTT 
Asn Val 



ACA GTT 
Thr Val 



GAC TTT 
Asp Phe 
210 

TGT ACA 
Cys Thr 
225 

CCA TAT 
Pro Tyr 



GCA GTA 
Ala Val 
180 



GCA GGA ACA TCC AGA CTA CTT GCA GIT 
Ala Gly Thr Ser Arg Leu Leu Ala Val 
40 45 

AAA AAA CCT AAC AAT AAC AAA ATA TTA 
Lys Lys Pro Asn Asn Asn Lys He Leu 
55 60 

CAA TAC AGG GTA TTT AGA ATA CAT TTA 
Gin Tyr Arg Val Phe Arg He His Leu 
70 75 

TTT CCT GAC ACC TCA TTT TAT AAT CCA 
Phe Pro Asp Thr Ser Phe Tyr Asn Pro 
85 90 

GCC TGT GTA GGT GTT GAG GTA GGT CGT 
Ala Cys Val Gly Val Glu Val Gly Arg 
105 

ATT AGT GGC CAT CCT TTA TTA AAT AAA 
He Ser Gly His Pro Leu Leu Asn Lys 
120 125 

AGT GCT TAT GCA GCA AAT GCA GGT GTG 
Ser Ala Tyr Ala Ala Asn Ala Gly Val 
135 140 

ATG GAT TAC AAA CAA ACA CAA TTG TGT 
Met Asp Tyr Lys Gin Thr Gin Leu Cys 
150 155 

ATA GGG GAA CAC TGG GGC AAA GGA TCC 
He Gly Glu His Trp Gly Lys Gly Ser 
165 170 

AAT CCA GGT GAT TGT CCA CCA TTA GAG 
Asn Pro Gly Asp Cys Pro Pro Leu Glu 
185 



ATT CAG 
He Gin 
195 

ACT ACA 
Thr Thr 



TCT ATT 
Ser He 



GTT AGA 
Val Arg 

GAC GAT 
Asp Asp 



GGC GAC 
Gly Asp 

CAT TTA 
His Leu 
260 

TTA TAC 
Leu Tvr 
275 



GAT GGT GAT ATG GTT GAT ACT GGC TTT 
Asp Gly Asp Met Val Asp Thr Gly Phe 
200 205 

TTA CAG GCT AAC AAA AGT GAA GTT CCA 
Leu Gin Ala Asn Lys Ser Glu Val Pro 
215 220 

TGC AAA TAT CCA GAT TAT ATT AAA ATG 
Cys Lys Tyr Pro Asp Tyr He Lys Met 
230 235 

AGC TTA TTT TTT TAT TTA CGA AGG GAA 
Ser Leu Phe Phe Tyr Leu Arg Arg Glu 
24 5 250 

TTT AAT AGG GCT GGT GCT GTT GGT GAA 
Phe Asn Arg Ala Gly Ala Val Gly Glu 
265 

ATT AAA GGC TCT GGG TCT ACT GCA AAT 
He Lys Gly Ser Gly Ser Thr Ala Asn 
280 285 



GGA CAT CCC 
Gly His Pro 



GTT CCT AAA 
Val Pro Lys 



CCT GAC CCC 
Pro Asp Pro 
80 

GAT ACA CAG 
Asp Thr Gin 
95 

GGT CAG CCA 
Gly Gin Pro 
110 

TTG GAT GAC 
Leu Asp Asp 



GAT AAT AGA 
Asp Asn Arg 



TTA ATT GGT 
Leu He Gly 
160 

CCA TGT ACC 
Pro Cys Thr 
175 

TTA ATA AAC 
Leu He Asn 
190 

GGT GCT ATG 
Gly Ala Met 



CTG GAT ATT 
Leu Asp He 



GTG TCA GAA 
Val Ser Glu 
240 

CAA ATG TTT 
Gin Met Phe 
255 

AAT GTA CCA 
Asn Val Pro 
270 

TTA GCC AGT 
Leu Ala Ser 



144 



19: 



240 



288 



336 



384 



432 



480 



528 



576 



624 



67: 



720 



768 



816 



864 
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TCA AAT TAT TTT CCT ACA CCT AGT GGT TCT ATG GIT ACC TCT GAT GCC 912 

Ser Asn Tyr Phe Pro Thr Pro Ser Gly Ser Met Val Thr Ser Asp Ala 

290 295 300 

CAA ATA TTC AAT AAA CCT TAT TGG TTA CAA CGA GCA CAG GGC CAC AAT 960 
Gin He Phe Asn Lys Pro Tyr Trp Leu Gin Arg Ala Gin Gly His Asn 
305 310 315 320 

AAT GGC ATT TGT TGG GGT AAC CAA CTA TTT GTT ACT GTT GOT GAT ACT 1008 
Asn Gly He Cys Trp Gly Asn Gin Leu Phe Val Thr Val Val Asp Thr 
325 330 335 

ACA CGC AGT ACA AAT ATG TCA TTA TGT GCT GCC ATA TCT ACT TCA GAA 1056 
Thr Arg Ser Thr Asn Met Ser Leu Cys Ala Ala He Ser Thr Ser Glu 
340 345 350 

ACT ACA TAT AAA AAT ACT AAC TTT AAG GAG TAC CTA CGA CAT GGG GAG 1104 
Thr Thr Tyr Lys Asn Thr Asn Phe Lys Glu Tyr Leu Arg His Gly Glu 
355 360 365 

GAA TAT GAT TTA CAG TTT ATT TTT CAA CTG TGC AAA ATA ACC TTA ACT 1152 
Glu Tyr Asp Leu Gin Phe He Phe Gin Leu Cys Lys He Thr Leu Thr 
370 375 380 

GCA GAC GTT ATG ACA TAC ATA CAT TCT ATG AAT TCC ACT ATT TTG GAG 12 00 

Ala Asp Val Met Thr Tyr He His Ser Met Asn Ser Thr lie Leu Glu 
385 390 395 400 

GAC TGG AAT TTT GGT CTA CAA CCT CCC CCA GGA GGC ACA CTA GAA GAT 124 8 

Asp Trp Asn Phe Gly Leu Gin Pro ,Pro Pro Gly Gly Thr Leu Glu Asp 
405 410 415 

ACT TAT AGG TTT GTA ACC TCC CAG GCA ATT GCT TGT CAA AAA CAT ACA 129 6 

Thr Tyr Arg Phe Val Thr Ser Gin Ala He Ala Cys Gin Lys His Thr 
420 425 430 

CCT CCA GCA CCT AAA GAA GAT CCC CTT AAA AAA TAC ACT TTT TGG GAA 1344 
Pro Pro Ala Pro Lys Glu Asp Pro Leu Lys Lys Tyr Thr Phe Trp Glu 
435 440 445 

GTA AAT TTA AAG GAA AAG TTT TCT GCA GAC CTA GAT CAG TTT CCT TTA 1392 
Val Asn Leu Lys Glu Lys Phe Ser Ala Asp Leu Asp Gin Phe Pro Leu 
450 455 460 

GGA CGC AAA TTT TTA CTA CAA GCA GGA TTG AAG GCC AAA CCA AAA TTT . 1440 

Gly Arg Lys Phe Leu Leu Gin Ala Gly Leu Lys Ala Lys Pro Lys Phe 
465 470 475 480 

ACA TTA GGA AAA CGA AAA GCT ACA CCC ACC ACC TCA TCT ACC TCT ACA 14 88 

Thr Leu Gly Lys Arg Lys Ala Thr Pro Thr Thr Ser Ser Thr Ser Thr 
485 490 495 

ACT GCT AAA CGC AAA AAA CGT AAG CTG TAA 1518 
Thr Ala Lys Arg Lys Lys Arg Lys Leu * 
500 505 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 506 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
Met Ser Leu Trp Leu Pro Ser Glu Ala Thr Val Tyr Leu Pro Pro Val 

Pro Val Ser Lys Val Val Ser Thr Asp Glu Tyr Val Ala Arg Thr Asn 
20 25 3 Z 

lie Tyr Tyr His Ala Gly Thr Ser Arg Leu Leu Ala Val Gly His Pro 

Jb 40 45 

Tyr Phe Pro lie Lys Lys Pro Asn Asn Asn Lys lie Leu Val Pro Lys 

Val Ser Gly Leu Gin Tyr Arg Val Phe Arg He His Leu Pro Asp Pro 
65 70 75 80 

Asn Lys . Phe Gly Phe Pro Asp Thr Ser Phe Tyr Asn Pro Asp Thr Gin 
85 9° 95 

Arg Leu Val Trp Ala Cys Val Gly Val Glu Val Gly Arg Gly Gin Pro 
100 105 110 

Leu Gly Val Gly lie Ser Gly His Pro Leu Leu Asn Lys Leu Asp Asp 
115 120 125 F . 

Thr Glu Asn Ala Ser Ala Tyr Ala Ala Asn Ala Gly Val Asp Asn Arg 
1JU 135 140 

Glu Cys lie Ser Met Asp Tyr Lys Gin Thr Gin Leu Cys Leu He Gly 
145 150 155 16 o 

Cys Lys Pro Pro lie Gly Glu His Trp Gly Lys Gly Ser Pro Cys Thr 
165 170 175 

Asn Val Ala Val Asn Pro Gly Asp Cys Pro Pro Leu Glu Leu He Asn 
180 185 190 

Thr Val lie Gin Asp Gly Asp Met Val Asp Thr Gly Phe Gly Ala Met 
195 200 205 

Asp Phe Thr Thr Leu Gin Ala Asn Lys Ser Glu Val Pro Leu Asp lie 
210 215 220 

Cys Thr Ser lie Cys Lys Tyr Pro Asp Tyr lie Lys Met Val Ser Glu 
22 ~ 230 235 240 

Pro Tyr Gly Asp Ser Leu Phe Phe Tyr Leu Arg Arg Glu Gin Met Phe 
245 250 255 

Val Arg His Leu Phe Asn Arg Ala Gly Ala Val Gly Glu Asn Val Pro 
260 265 270 

Asp Asp Leu Tyr lie Lys Gly Ser Gly Ser Thr Ala Asn Leu Ala Ser 
275 280 285 

Ser Asn Tyr Phe Pro Thr Pro Ser Gly Ser Met Val Thr Ser Asp Ala 
290 295 300 
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Gln lie 
305 

Asn Gly 

Thr Arg 

Thr Thr 

Glu Tyr 
3*70 

Ala Asp 
385 

Asp Trp 

Thr Tyr 

Pro Pro 

Val Asn 
450 

Gly Arg 
465 

Thr Leu 
Thr Ala 



Phe Asn Lys 

lie Cys Trp 
325 

Ser Thr Asn 
340 

Tyr Lys Asn 
355 

Asp Leu Gin 

Val Met Thr 

Asn Phe Gly 
405 

Arg Phe Val 
420 

Ala Pro Lys 
435 

Leu Lys Glu 
Lys Phe Leu 



Pro Tyr Trp Leu 
310 

Gly Asn Gin Leu 



Met Ser 



Thr Asn 



Phe He 
375 

Tyr He 
390 



Leu Cys 
345 

Phe Lys 
360 

Phe Gin 



His Ser 



Leu Gin Pro Pro 



Thr Ser 



Glu Asp 



Lys Phe 
455 

Leu Gin 
470 



Gin Ala 
425 

Pro Leu 
440 

Ser Ala 
Ala Gly 



Gly Lys Arg 
485 

Lys Arg Lvs 
500 



Lvs Ala Thr Pro 



Gin Arg Ala Gin Gly His Asn 
315 320 

Phe Val Thr Val Val Asp Thr 
330 335 

Ala Ala He Ser Thr Ser Glu 
350 

Glu Tyr Leu Arg His Gly Glu 
365 

Leu Cys Lvs He Thr Leu Thr 
380 

Met Asn Ser Thr He Leu Glu 
395 400 

Pro Gly Gly Thr Leu Glu Asp 
410 415 

He Ala Cys Gin Lys His Thr 
430 

Lys Lys Tyr Thr Phe Trp Glu 
445 

Asp Leu Asp Gin Phe Pro Leu 
460 

Leu Lys Ala Lys Pro Lys Phe 
475 480 

Thr Thr Ser Ser Thr Ser Thr 
490 495 



Lys Arq 



Lys Leu 
505 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 97 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . .297 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 

ATG CAT GGA GAT ACA CCT ACA TTG CAT GAA TAT ATG TTA GAT TTG CAA 4 8 

Met His Gly Asp Thr Pro Thr Leu His Glu Tyr Met Leu Asp Leu Gin 
15 10 15 



CCA GAG ACA ACT GAT CTC TAC TGT TAT GAG CAA TTA AAT GAC AGC TCA 
Pro Glu Thr Thr Asp Leu Tyr Cys Tyr Glu Gin Leu Asn Aso Ser Ser 
20 25 30 



96 
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GAG GAG GAG GAT GAA ATA GAT GGT CCA GCT GGA CAA GCA GAA CCG GAC 144 
Glu Glu Glu Asp Glu He Asp Gly Pro Ala Gly Gin Ala Glu Pro Asp 
35 40 45 

AGA GCC CAT TAC AAT ATT GTA ACC TTT TGT TGC AAG TGT GAC TCT ACG 192 
Arg Ala His Tyr Asn He Val Thr Phe Cys Cys Lys Cys Asp Ser Thr 
so 55 go 

CTT CGG TTG TGC GTA CAA AGC ACA CAC GTA GAC ATT CGT ACT TTG GAA 24 0 

Leu Arg Leu Cys Val Gin Ser Thr His Val Asp He Arg Thr Leu Glu 
65 7 0 75 80 

GAC CTG TTA ATG GGC ACA CTA GGA ATT GTG TGC CCC ATC TGT TCT CAG 2 88 

Asp Leu Leu Met Gly Thr Leu Gly He Val Cys Pro He Cys Ser Gin 
85 so 95 

AAA CCA TAA 
Lys Pro * 

(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE : protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met His Gly Asp Thr Pro Thr Leu His Glu Tyr Met Leu Asp Leu Gin 
15 10 15 

Pro Glu Thr Thr Asp Leu Tyr Cys Tyr Glu Gin Leu Asn Asp Ser Ser 
20 25 30 

Glu Glu Glu Asp Glu He Asp Gly Pro Ala Gly Gin Ala Glu Pro Asp 
35 40 45 

Arg Ala His Tyr Asn He Val Thr Phe Cys Cys Lys Cys Asp Ser Thr 
50 55 60 

Leu Arg Leu Cys Val Gin Ser Thr His Val Asp He Arg Thr Leu Glu 
65 7 0 75 80 

Asp Leu Leu Met Gly Thr Leu Gly He Val Cys Pro He Cys Ser Gin 
85 90 95 

Lys Pro * 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 
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CCCCGATATC GCCTTTAATG TATAAATCGT CTGG 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:€: 
CCCCGATATC TCAAATTATT TTCCTACACC TAGTG 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
AAAGATATCT TGTAGTAAAA ATTTGCGTCC TAAAGGAAAC 
(2) INFORMATION FOR SEQ ID NO: 8: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



<xii SEQUENCE DESCRIPTION : SEQ ID NO: 8: 
AAAGATATCT AATCTACCTC TACAACTGCT AAACGCAAAA AACG 
(2) INFORMATION FOR -SEQ ID NO:9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

AAAAGATATC ATGCATGGAG ATACACCTAC ATTGC 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 34 base pairs 
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(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
TTTTGATATC GGCTCTGTCC GGTTCTGCTT GTCC 
(2) INFORMATION FOR SEQ ID NO: 11: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
TTTTGATATC CTTGCAACAA AAGGTTACAA TATTGTAATG GGCC 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xii SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
AAAAGATATC TGGTTTCTGA GAACAGATGG GGCAC 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
TTTTGATATC GATTATGAGC AATTAAATGA CAGCTCAG 
(2) INFORMATION FOR SEQ ID NO:14: 

(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: 
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TTTTGATATC GTCTACGTGT GTGCTTTGTA CGCAC 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
TTTATCGATA TCGGTCCAGC TGGACAAGCA GAACCGGAC 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:16: 
TTTTGATATC GATGCCCATT ACAATATTGT AACCTTTTG 
(2) INFORMATION FOR SEQ ID NO: 17; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 294 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1 . .294 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1*7 : 

ATG AGT CTT CTA ACC GAG GTC GAA ACG CTT ACC AGA AAC GGA TGG GAG 
Met Ser Leu Leu Thr Glu Val Glu Thr Leu Thr Arg Asn Gly Trp Glu 
1 5 10 15 

TGC AAA TGC AGC GAT TCA AGT GAT CCT CTC ATT ATC GCA GCG AGT ATC 
Cys Lys Cys Ser Asp Ser Ser Asp Pro Leu lie lie Ala Ala Ser lie 
20 25 30 

i 

ATT GGG ATC TTG CAC TTG ATA TTG TGG ATT TTT TAT CGT CTT TTC TTC 
lie Gly lie Leu His Leu lie Leu Trp lie Phe Tyr Arg Leu Phe Phe 
35 40 45 

AAA TGC ATT TAT CGT CGC CTT AAA TAC GGT TTG AAA AGA GGG CCT TCT 
Lys Cys lie Tyr Arg Arg Leu Lys Tyr Gly Leu Lys Arg Gly Pro Ser 
50 55 60 



WO 99/18220 



PCT/US98/20965 



-37- 

ACG GAA GGA GCG CCT GAG TCT ATG AGG GAA GAA TAT CGG CAG GAA CAG 240 
Thr Glu Gly Ala Pro Glu Ser Met Arg Glu Glu Tyr Arg Gin Glu Gin 
65 70 75 80 

CAG AGT GCT GTG GAT GTT GAC GAT GTT CAT TTT GTC AAC ATA GAG CTG 288 
Gin Ser Ala Val Asp Val Asp Asp Val His Phe Val Asn He Glu Leu 
85 90 95 



GAG TAA 
Glu * 



294 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

< i i ) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l8: 

Met Ser Leu Leu Thr Glu Val Glu Thr Leu Thr Arg Asn Gly Trp Glu 
15 10 is 

Cys Lys Cys Ser Asp Ser Ser Asp Pro Leu He He Ala Ala Ser He 
20 25 30 

He Gly He Leu His Leu He Leu Trp He Phe Tyr Arg Leu Phe Phe 
35 40 45 

Lys Cys He Tyr Arg Arg Leu Lys Tyr Gly Leu Lys Arg Gly Pro Ser 
50 55 60 

Thr Glu Gly Ala Pro Glu Ser Met Arg Glu Glu Tyr Arg Gin Glu Gin 
65 7 0 75 80 

Gin Ser Ala Val Asp Val Asp Asp Val His Phe Val Asn He Glu Leu 
85 90 95 

Glu * 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : sincle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
TTTTGATATC GATATGGAAT GGCTAAAGAC AAGACCAATC 40 
(2) INFORMATION FOR SEQ ID NO:20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 
TTTTGATATC GTTGTTTGGA TCCCCATTCC CATTG 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
GTTATGACAT ACATACATTC TATG 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
CCATGCATTC CTGCTTGTAG TAAAAATTTG CGTCC 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 23: 
CTACAAGCAG GAATGCATGG AGATACACC 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: sincrle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
CATCTGAAGC TTAGTAATGG GCTCTGTCCG GTTCTG 
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(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 

CATCTGAAGC TTATCAATAT TGTAATGGGC TCTGTCCG 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 54 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 
CATCTGAAGC TTACTTGCAA CAAAAGGTTA CAATATTGTA ATGGGCTCTG TCCG 
£2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 base pairs 

(B) TYPE: nucleic acid 
(CJ STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 
CATCTGAAGC TTAAAGCGTA GAGTCACACT TGCAACAAAA GGTTACAATA TTGTAATGGG 
CTCTGTCCG 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 28: 
CATCTGAAGC TTATTGTACG CACAACCGAA GCGTAGAGTC ACACTTG 
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WHAT IS CLAIMED IS: 

1 . A vaccine formulation comprising a human papilloma 
virus capsomere, said capsomere comprising a fusion protein comprising a 
human papilloma virus LI protein adjacent amino acid residues from a 
second protein. 

2. A vaccine formulation comprising a human papilloma 
virus capsomere. said capsomere comprising a truncated human papilloma 
vims LI protein having a deletion of one or more amino acid residues 
necessary for formation of a virus-like panicle. 

3. The vaccine formulation of claim 2 wherein said 
capsomere comprises a fusion protein comprising a truncated human 
papilloma virus LI protein adjacent amino acid residues from a second 
protein. 

4. The vaccine formulation of any one of claims 1,2, or 
3 wherein the LI protein is encoded in the genome of a human papilloma 
vims selected from the group consisting of HPV6, HPV11, HPV16, 
HPV18. HPV33, HPV35. and HPV45. 

5. The vaccine formulation of claim 4 wherein the 
papilloma virus is HVPI6. 

6. The vaccine formulation of any one of claims 2, 3, or 
5 wherein carboxy terminal amino acid residues are deleted from the LI 
protein. 

7. The vaccine formulation of claim 6 wherein 1 to 34 
carboxy terminal amino acid residues are deleted from the LI protein. 
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8. The vaccine formulation of claim 7 wherein 34 
carboxy terminal amino acid residues are deleted from the LI protein. 

9. The vaccine formulation of any one of claims 2, 3, or 
5 wherein amino terminal amino acid residues are deleted from the LI 
protein. 

10. The vaccine formulation of any one of claims 2, 3, or 
5 wherein internal amino acid residues are deleted from the LI protein. 

11. The vaccine formulation of claim 10 wherein the 
amino acid residues deleted from the L! protein comprise a nuclear 
localization signal. 

12. The vaccine formulation of claims 2 or 3 wherein the 
amino acids residues from the second protein are derived from an HPV 
protein. 

13. The vaccine formulation of claim 12 wherein the 
HPV protein is an early HPV protein. 

14. The vaccine u>rmulation of claim 12 wherein the 
early HPV protein is selected from the group consisting of El, E2, E3 ? E4, 
E5. E6. and E7. 



15. A method of treating an individual infected with an 
HPV virus comprising the step of administering to a patient in need thereof 
an amount of the vaccine formulation of claims 1, 2, 3, 5, 7, 8, 11, 13 or 
14 effective to reduce the level of HPV infection. 
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16. A method for preventing papilloma vims infection 
comprising the step of administering to an individual susceptible thereto an 
amount of the vaccine formulation of claims 1, 2, 3, 5, 7, 8, 11, 13 or 14 
effective to inhibit HPV infection. 



INTERNATIONAL SEARCH REPORT 



Intel onal Application No 

PCT/US 98/20965 



A. CLASSIFICATION OF SUBJECT MATTER 

IPC 6 C12N15/62 A61K39/12 



According to International Patent Classification (IPC) or to both national classification and IPC 



B. FIELDS SEARCHED 



Minimum documentation searched (classification system followed by classification symbols) 

IPC 6 C07K C12N A61K 



Documentation searched other than minimum aocumemation to the extent that such documents are included in the fields searched 



Electronic data base consulted during the international search (name of data base and, where practical, search terms used) 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category : Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to daim No. 



MULLER M ET AL . : "Chimeric 
papillomavirus-like particles" 
VIROLOGY, 

vol. 234, no. 1, 21 July 1997, pages 

93-111, XP002091857 

ORLANDO US 

see the whole document 

DE 44 35 907 A (GISSMANN L;ZH0U J; MULLER 

M) 11 April 1996 

see the whole document 

-/- 



1-8, 
10-16 



1-5, 
10-16 



[ X J Further documents are listed in the continuation of box C. 



Patent family members are listed in annex. 



" Special categories of cited documents : 

"A" document defining the general state of the art which is not 
considered to be of particular relevance 

"E" earlier document but published on or after the international 
tiling date 

"L* document which may throw doubts on priority claim(s) or 
wmch is cited to establish the publication date of another 
citation or other special reason (as specified) 

"O" document referring to an oral disclosure, use. exhibition or 
other means 

"P" document published prior to the international King date but 
later than the priority date claimed 



T later document published after the international tiling date 
or priority date and not in conflict with the application but 
cited to understand the principle or theory underlying the 
invention 

"X" document of particular relevance; the claimed Invention 
cannot be considered novel or cannot be considered to 
involve an inventive step when the document is taken alone 

"Y" document ot particular relevance: the claimed invention 

cannot be cons we red to involve an inventive step when the 
document is combined with one or more other such docu- 
ments, such combination being obvious to a person skilled 
in the an. 

"4" document member ot the same patent family 



Date ot the actual completion of the international search 



1 February 1999 



Date ot mailing ot the international search report 

16/02/1999 



Name and mailing address of the ISA 

European Patent Office, P.B. 5818 Patentlaan 2 
NL - 2280 HV Rijswijk 
Tel. (+31-70) 340-2040. Tx. 31 651 epo nl. 
Fax: (+31-70) 340-3016 



Authorized officer 



Cupido, M 



Form PCT/ISA/2tO(secona shMt) (July 1992) 



page 1 of 2 



INTERNATIONAL SEARCH REPORT 



Inte onal Application No 

PCT/US 98/20965 



C.(Contlnuation) DOCUMENTS CONSIDERED TO BE RELEVANT 



Category - Citation of document, with indication,wnere appropriate, ot me relevant passages 



Relevant to claim No. 



LI H ET AL.: "Expression of the human 

papillomavirus type 11 LI capsid protein 

in Escherichia coli" 

JOURNAL OF VIROLOGY, 

vol. 71, no. 4, April 1997, pages 

2988-2995, XP002091858 

AMERICAN SOCIETY FOR MICROBIOLOGY US 

see figure 7 

PAINTSIL J ET AL.: "Carboxy terminus of 
bovine pappilomavirus type-1 LI protein is 
not required for capsid formation" 
VIROLOGY, 

vol. 223, no. 1, 1 September 1996, pages 
238-244, XP002091859 
ORLANDO US 
see figure 1 

ROSE R C ET AL: "SEROLOGICAL 

DIFFERENTIATION OF HUMAN PAPILLOMAVIRUS 

TYPES 11, 16 AND 18 USING RECOMBINANT 

VIRUS-LIKE PARTICLES" 

JOURNAL OF GENERAL VIROLOGY, 

vol. 75. no. 9, September 1904, pages 

2445-2449, XP000604635 

WO 96 11274 A (US DEPARTMENT OF HEALTH) 
18 April 1996 
see examples 1-7 



1-4,9 



1-16 



1-16 



1-16 



Form PCT/ISA/210 (continuation ot sacond sheet) (July 1992) 



page 2 of 2 



INTERNATIONAL SEARCH REPORT 



(-national application No. 
PCT/ US 98/20965 



Box I Observations where certain claims were found unsearchable (Continuation of item 1 of first sheet) 

This international Search Repon has not been established in respect ot certain claims under Article 1 7(2)(a) for the following reasons: 

1. | X j Claims Nos.: 

because they relate to subject matter not required to be searched by this Authority, namely: 

Remark: Although claims 15 and 16 

are directed to a method of treatment of the human or animal 
body, the search has been carried out and based on the alleged 
effects of the vaccine formulation. 

2. I | Claims Nos.: 

because they relate to parts ol the International Application that do not comply with the prescribed retirements tosuch 
an extent that no meaningful International Search can be carried out. specifically: 



3. | 1 Claims Nos.: 

because they are aepenaent claims ana are not dratted in accordance with the second and third sentences ot Rule6.4(a). 

Box ii Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 

This International Searching Authority found multiple inventions in this international application, as follows: 



□ As alt required additional search fees were timely paid by the applicant, this international Search Report covers all 
searchable claims. 



2. Q As all searchable claims could be searched without effort justifying an additional fee. this Authority did not invite payment 
of any additional fee. 



3. I I As only some of the required additional search fees were timelv paid by the applicant, this International Search Report 
1 — 1 covers only those claims tor which fees were paid, specifically claims Nos.: 



4. Q No required additional search fees were timely paid by the applicant. Consequently, this international Search Report is 
restricted to the invention tirst mentioned in the claims: it is covered by claims Nos.: 



Remark on Protest Q The additional search tees were accompanied by the applicants protest. 

[ [ No protest accompanied the payment of additional search lees. 



Form PCT/lSA/210 (continuation of first sheet (1))(July 1992) 



INTERNATIONAL SEARCH REPORT 

, (formation on patent (amity members 



Inter wal Application No 

PCT/US 98/20965 



Patent document 
cited in search report 



Publication 
date 



Patent tamily 
member(s) 



Publication 
date 



DE 4435907 



11-04-1996 



AU 
CA 
WO 
EP 



4270196 A 
2202090 A 
9611272 A 
0809700 A 



02- 05-1996 
18-04-1996 
18-04-1996 

03- 12-1997 



WO 9611274 


A 


18-04-1996 


US 


5618536 A 


08-04-1997 






AU 


3828495 A 


02-05-1996 








EP 


0789766 A 


20-08-1997 








JP 


10506796 T 


07-07-1998 








US 


5855891 A 


05-01-1999 



Fonn PCT/ISA/210 tpawm tamily annex) (July 1992) 



THIS PAGE BLANK (uspto) 



